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Abstract 

A key measure that has been used extensively in analyzing complex networks 
is the degree of a node (the number of the node's neighbors). Because of its dis- 
crete nature, when the degree measure was used in analyzing weighted networks, 
weights were either ignored or thresholded in order to retain or disregard an edge. 
Therefore, despite its popularity, the degree measure fails to capture the disparity 
of interaction between a node and its neighbors. 

We introduce in this paper a generalization of the degree measure that ad- 
dresses this limitation: the continuous node degree (C-degree). We prove that 
in general the C-degree reflects how many neighbors are effectively being used 
(taking interaction disparity into account) and if a node interacts uniformly with 
its neighbors (no interaction disparity) the C-degree of the node becomes identi- 
cal to the node's (discrete) degree. We analyze four real-world weighted networks 
using the new measure and show that the C-degree distribution follows the power- 
law, similar to the traditional degree distribution, but with steeper decline. We also 
show that the ratio between the C-degree and the (discrete) degree follows a pattern 
that is common in the four studied networks. 

1 Introduction 

Network analysis is an interdisciplinary field of research that spans over biology, chem- 
istry, computer science, sociology, and others. A key measurement that has been used 
extensively in analyzing networks is the degree of a node. A node's degree is the 
number of edges incident to that node. Intuitively, the degree of a node reflects how 
connected the node is. This simple measure (along with other network measures) al- 
lowed the discovery of universal patterns in networks, such as the power law of the 
degree distribution [3] [8] . 

One of the limitations of the degree measure is that it ignores any disparity in the 
interaction between a node and its neighbors. In other words, the degree measure 
assumes uniform interaction across each node's neighbors. This can result in giving an 
incorrect perception of the effective node degree. For example, a person may have 10 or 
more acquaintances but mainly interacts with only two of them (friends). Should that 
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person be considered 2 times more connected than a person with only 5 acquaintances 
but also interacting primarily with two of them? 

Several network measures were proposed to analyze weighted networks (2] |4] [5] 
[Toll , where an edge's weight quantifies the amount of interaction over the edge. How- 
ever, none of the previously developed measures is a proper generalization of the de- 
gree measure. A proper generalization of the degree measure that captures the dis- 
parity of interactions needs to satisfy three properties. The first property is preserving 
the maximum traditional degree: if all weights incident to a node are equal (maximum 
utilization of neighbors), then the generalized degree is maximum and should be equal 
to the traditional (discrete) degree. The second property is preserving the minimum 
traditional degree: if all edges incident to a node have weights that are almost zero 
except one edge that has a weight much larger than zero (the node interacts primarily 
with one neighbor) then the generalized degree should be very close to 1. The third 
and final property is the consistent handling of disparity: the partial order imposed by 
the generalized degree on any two nodes needs to be consistent with the previous two 
properties. Intuitively, this means the more equal the weights are, the higher their gen- 
eralized degree should be. We formalize these properties into axioms in the following 
section. 

A generalization of the degree measure is significant because it bridges the gap be- 
tween the extensive research made using the degree (which ignored weights) and the 
research on weighted networks. Furthermore, it allows more accurate analysis of the 
networks that were previously analyzed using the degree measure. For example, it is 
known that the degree distribution of the Internet follows the power law [8 |. How- 
ever, if one takes the disparity of interactions into account, does the effective degree 
distribution of the Internet still follow a power law? 

We introduce in this paper a new measure for analyzing weighted networks: the 
continuous degree ( C-degree). What sets our measure apart from previous work is that 
it is a continuous generalization of the degree measure that captures the disparity of 
interaction. In particular, we prove that if every node interacts with all its neighbors 
equally, then the C-degree becomes identical to traditional (discrete) degree measure 
of the same node. However, if there is a disparity in a node's interaction with its 
neighbors, then the C-degree will capture such disparity, unlike the traditional degree 
measure. 

An implicit assumption for using our measure is that network weights quantify 
the amount of interaction, therefore all the weights are positive. We analyze four real 
world networks using the new measure and show that the C-degree distribution still 
follows the power law (but with a steeper power coefficient than the discrete degree 
distribution). We also show that the ratio between the C-degree and the traditional 
degree is bounded. 

2 Background 

A network (graph) is defined as N = (V, E), where V is the set of network nodes 
(vertices) and E is the set of edges (links) connecting these nodes. The degree of a 
node d € Vis k(v) = \E(v)\, where |-E(i>)| is the number of edges incident to node 
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v. The degree distribution P(k) measures the frequency of a particular degree k in 
a network: P(k = u) = \{v : v £ V A = The degree distribution is a 
common method for combining the degrees of all network nodes into one measure that 
summarizes and characterizes the network. 

This paper focuses on weighted networks where the weight of an edge w(e) > e 
quantifies the amount of interaction across the edge e and e is a small constant greater 
than (and close to) zero. We call networks that satisfy this property interaction net- 
works. For example, an edge weight can represent the number of times a person calls 
a friend or the number of packets transmitted on an Internet link. For convenience, we 
define for each node i the set of incident weights W(i) = {w(e) : e £ E(i)}. A node 
strength s(i) = YlwewU) w G) is the summation of weights incident to a node v. 

Before designing a measure that generalizes the traditional node degree so that 
it captures the disparity of interactions among neighbors, one needs to clearly define 
properties that this new measure needs to satisfy. Let r(i) be a generalized degree of 
node i. The first property of r(i) is preserving the maximum traditional degree: if all 
weights incident to a node i are equal, then the generalized degree of node i, r(i), is 
maximum and equal to the traditional (discrete) degree of node i, i.e r(i) = k(i). The 
second property is preserving the minimum traditional degree: if all edges incident to 
node i have weights that are almost zero except one edge that has weight much larger 
than zero (i.e. node i interacts primarily with one neighbor) then the generalized degree 
of node i should be very close to 1. 

The third and final property is the consistent handling of disparity: the partial order 
imposed by the generalized measure on any two nodes need to be consistent with the 
previous two properties. Intuitively, this means the more equal the weights are, the 
higher their generalized degree should be. Many partial ordering are possible, but here 
we define what we believe is the minimum requirement. If two nodes i and j have the 
same number of neighbors n, the same strength, and have n — 2 common weights, then 
the generalized degree should be inversely proportional to difference between the un- 
common weights of each node. For example, suppose W^(ul) = {5, 5, 5, 5}, W(v2) = 
{9,5,5,1},W(«3) = {9, 8, 2, 1} and W(v4) = {20 - 3e, e, e, e}. The third prop- 
erty then require the generalized degree measure r to impose the following ordering: 
r(vl) > r(v2) > r(v3). All three properties require that k(vl) — r(vl) > r(v2) > 
r(v3) > r(v4) i=s 1. The following axioms formalize the three properties that a gener- 
alized degree measure r needs to satisfy. 

1. Preserving maximum degree: r(i) = k(i) = \E(i)\ iff W{i) = W max (i), 
where W max {i) = {w; : w t = j^l < I < k(i)}. 

2. Preserving minimum degree: r(i) is close to 1 iff 3u : w(u) » eiv =/= u : 
w(v) = e. 

3. Consistent disparity: Vi, j such that k(i) — k(j) — n,s(i) — s(j), \W(i)f]W(j) 
n - 2, {wa,wa} = w\i) - W(j), and { Wj i,w j2 } = W(j) - W{i): if 
\wn —wa\ < \wji - Wj2\ then r(i) > r(j). 

Some of the previous work used a cutoff weight-threshold in order to either include 
or exclude a weighted edge and then computed the degree distribution normally @|9]. 
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Such an approach, however, does not properly handle the disparity of interaction among 
neighbors, but rather approximates a weighted network with an unweighted network. 

Surveying all network measures that were proposed to analyze weighted networks 
is beyond the scope of this paper. Instead, we focus on a sample of these measures 
that are mostly related to our contribution (interested reader may refer to survey papers 
on the subject such as 0). The weight distribution P(w) is similar to the degree 
distribution except that it measures the frequency of a particular edge weight. This 
measure neither generalizes the degree distribution nor does it capture the disparity in 
interaction between an individual node and its neighbors. 

The strength of a node becomes identical to the node's degree if all weights are 
equal to 1. The strength measure, however, fails to capture the disparity of interac- 
tion between an individual node and its neighbors (the consistency axiom). A more 
recent work [ 10 1 analyzed a graph's total weight, X^ees w ( e )' against the graph's total 
number of edges, \E\, over time. That work also analyzed the degree of a node, k(v), 
against the node's strength, s(v). These measures again fail to capture the disparity in 
interaction between a node and its neighbors. 

The network measure Y(v) — J2eeE(v) ( s^uf) successfully captures the dispar- 
ity of interaction within a node v [12 1. However, the Y measure is not a generalization 
of the degree measure as it fails to satisfy the first two axioms. 

An interesting method for generalizing unweighted network measures (including 
the node degree) to weighted networks is generating an ensemble of unweighted net- 
works that are sampled from the original weighted network JT|. The underlying as- 
sumption the method is that the weight of an edge reflects the probability of generating 
the edge in a sample network. The effective node degree is then the average over the 
samples. While the ensemble approach satisfy the first two axioms, it fails to satisfy 
the third axiom, the consistency in handling disparity. The following section presents 
our proposed measure, the continuous degree, and proves that it satisfies all the three 
axioms of a generalized node degree. 



3 The Continuous Degree, C-degree 

The inherent problem with the degree is it being discrete. A neighbor is either counted 
in the degree or not. We propose a generalization of the degree measure that takes edge 
weights into account. 

Definition 1 The C-degree of a node v in a network is r(v), where 

{0 ifv is disconnected 

2^eesw ~ log 2 —) otherwise 

Where s(v) = J2eeE(v) w i e ) is the strength of node v. Intuitively, the quantity 
represents the probability of an interaction over an edge e. The set j : e € E(v)\ 

is the interaction probability distribution for node v. The quantity Y^ e eE{v) 7{v) ^ §2 
is then the entropy of the interaction probability distribution, or how many bits are 
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needed to encode the interaction probability distribution. The entropy quantifies the 
disparity in the interaction distribution: the more uniform the interaction distribution 
is, the higher the entropy and vice versaQ The purpose of the power 2 is to convert the 
entropy back to the number of neighbors that are effectively being used. 

Figure Q~]compares the continuous degree distribution to the (discrete) degree distri- 
bution in a simple weighted network of four nodes. A node on the boundary has an out 
degree of 1, while an internal node has an out degree of 2. Intuitively, however, only 
one of the internal nodes is fully utilizing its degree of 2 (the one to the left), while 
the other node (to the right) is mostly using one neighbor only. The C-degree measure 
captures this and shows that only one internal node has a C-degree of 2 while the other 
internal node has a C-degree of 1.38. In the remainder of this section we prove that the 
C-degree satisfies all the three properties defined earlier. 




(a) network of four nodes, where k is the out-degree of a node 
and r is the continuous out-degree of a node. 



P(k)1 
2 



(b)The degree distribution 
of the network in (a) 



P(r)' 

2 
1 
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(c)The continuous degree 
distribution of the network in (a) 



Figure 1 : Continuous vs discrete degree distributions. 

In the remainder of this section we prove that the C-degree satisfies all the three 
properties defined earlier. 

Theorem 2 The C-degree measure satisfies all the three axioms of a generalized node 
degree. 

Proof The proof directly follows from the following three lemmas. 
Lemma 3 The C-degree satisfies the minimum degree axiom. 

Proof When all weights are close to zero except only one weight that is much bigger 
than zero, then the entropy (the exponent of the C-degree) is close to zero, and therefore 
the C-degree is close to 1 . 

Lemma 4 The C-degree satisfies the maximum degree axiom. 

1 This is in contrast to the Y measure, which decreases if the interaction distribution becomes more uni- 
form. 
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We then have 



Proof Under uniform interaction, all the weights incident to a node v are equal to a 
constant W v . Therefore 

ij j r „, , w(e) W v 1 

v „ eV , eeE( „ ): ^ = __ = _ 

\/v : r(v) = 2^= e - B <"> S2 ^ 
_ 2^ fc (^) fc^j iog 2 (fe(f)) 
_ 2iog 2 (fe(")) 
= k(v) 

In other words, both the degree and the C-degree of a node become equivalent under 
uniform interaction. The C-degree is also maximum in this case, because the exponent 
is the entropy of the interaction distribution, which is maximum when the interaction 
is uniform over edges. 

Lemma 5 The C-degree satisfies the consistent disparity property. 

Proof Let i,j be two nodes such that k(i) — k(j) — n,s(i) = s(j) = s, \ W(i)f]W(j)\ = 
n - 2,{wa,w i2 } = W(i) - W(j),{wji,w j2 } = W(j) - W(i), and \w a - w l2 \ < 
\iVji — Wj2 1 ■ Without loss of generality, we can assume that Wn > Wi 2 and Wji > wj 2 , 
therefore wn — Wi 2 < Wji — Wj 2 . We also have 



Wjl + W i2 _ ^ _ ™ _ w jl + W J2 _ c 

weW(i)r\W(j) S 



, therefore 

'i 

s s ~ 2 ~ s s 

Then from Lemma [6] 



Wjl Wn C Wn Wjl 

> > - > c > c 



Mc,— )>h(c,^) 
s s 

Wil. ,Wn Wn , Wn Wji ,Wji Wjl Wjl 

h( — ) - (c )lg(c ) > — -lg(-) - (c - —)lg{c - 

s s s s s s s s 

Therefore H(i) > H(j), because the rest of the entropy terms (corresponding to 
W(i) P| W(j) are equal, and consequently r(i) > r(j). 

Lemma 6 The quantity h(c, x) = —xlg(x) — (c — x)lg(c — x) is symmetric around 
and maximized at x — | for c > x > 0. 



Proof Symmetry: h(c,§ + 5) = -(§ +5)lg(§ + 6) - (f - S)lg{§ - 6) = h(c,§-5). 

h(c,2 
dx 



Maximum: h(c, x) is maximized when dh( £> x ) = fj = — 1 - Igx + 1 + lg(c — x), 



therefore x = c — x = 
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Theorem [2] means that, in the simple case of uniformly distributed utilization of 
connections, the continuous degree distribution preserves well-known properties of the 
discrete degree distribution, such as the power law. It remains, however, to investigate 
what laws the continuous degree distribution follows in the more general setting of 
interaction networks. In the following section we use the C-degree measure to analyze 
four real- world weighted networks. 

4 Case Studies 

We have analyzed four real world weighted networks that capture coauthorships be- 
tween scientists. Three of which were extracted from preprints on the E-Print Archive 
IfTTI : condensed matter (updated version of the original dataset that includes data be- 
tween Jan 1, 1995 and March 31, 2005), astrophysics, and high-energy theory. The 
fourth network represents coauthorship of scientists in network theory and experiment 

E2. 

It was shown that the degree distribution of many real networks follows the power 
law OH). A degree distribution follows the power law if P(k) oc fc~ 7 , where 7 is a 
constant. Figure [2]displays the C-degree distribution (CDD) and the (discrete) degree 
distribution (DD) for the four collaboration network. The figure uses log-log scale with 
the power law fit (based on [7 El). The CDD preserves the power-law behavior, but with 
steeper decline, which is consistent with Lemma|4] 

One would expect that as the degree of a node increases, the node will interact 
primarily with a smaller subset of neighbors. To verify this intuition, we define the 
degree utilization metric as the ratio between the C-degree and the degree of a node: 
u{v) = The degree utilization metric captures the percentage of links that a node 
uses effectively, therefore we expect the degree utilization to decrease as the degree 
increases. FigureOplots the degree utilization against the (discrete) degree for the four 
collaboration networks. A common pattern emerges in the four networks. For low 
degrees, the degree utilization is relatively high (a node with few links makes the best 
of them). For node degree greater than some constant the bias towards high degree 
utilization disappears. However, and to our surprise, a cone is observed, which starts 
wide at low degrees and gets narrower as the degree increases (the average degree 
utilization is plotted as a line in the figure). 

5 Conclusion 

We introduced in this paper a new measure for analyzing weighted networks: the C- 
degree. We proved that our measure is a continuous generalization of the discrete 
degree, and therefore bridges the gap between the analysis using the degree in un- 
weighted networks and weighted networks, where weights quantify interaction among 
nodes. We demonstrated the applicability of this new measure by analyzing four real- 
world weighted networks. We showed that the C-degree distribution follows the power 

2 Available through http://www-personal.umich.edu~mejn/netdata/ 

3 Source code available from http://www.santafe.edu/aaronc/powerlaws/ 
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Figure 2: Comparing the DD with CDD for the collaboration networks. 



law, but with a steeper power-coefficient. We also investigated the ratio between the 
C-degree and the traditional degree and showed that the on average it is lower bounded, 
even for nodes with high-degree. 
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Figure 3: Scatter plot of a node degree against its degree utilization for the four collaboration 
networks, the average utilization per degree is also plotted. 
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